A survey of sparse matrix-vector multiplication performance on large matrices

نویسندگان

  • Max Grossman
  • Christopher Thiele
  • Mauricio Araya-Polo
  • Florian Frank
  • Faruk O. Alpak
  • Vivek Sarkar
چکیده

One of the main sources of sparse matrices is the discretization of partial differential equations that govern continuumphysics phenomena such as fluid flow and transport, phase separation, mechanical deformation, electromagnetic wave propagation, and others. Recent advances in high-performance computing area have been enabling researchers to tackle increasingly larger problems leading to sparse linear systems with hundreds of millions to a few tens of billions of unknowns, e.g., [5, 6]. Iterative linear solvers are popular in large-scale computing as they consume less memory than direct solvers. Contrary to direct linear solvers, iterative solvers approach the solution gradually requiring the computation of sparse matrixvector (SpMV) products. The evaluation of SpMV products can emerge as a bottleneck for computational performance within the context of the simulation of large problems. In this work, we focus on a linear system arising from the discretization of the Cahn–Hilliard equation, which is a fourth order nonlinear parabolic partial differential equation that governs the separation of a two-component mixture into phases [3]. The underlying spatial discretization is performed using the discontinuous Galerkin method (e.g. [10]) and Newton’s method. A number of parallel algorithms and strategies have been evaluated in this work to accelerate the evaluation of SpMV products.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the Effects of Hardware Parameters on Power Consumptions in SPMV Algorithms on Graphics Processing Units (GPUs)

Although Sparse matrix-vector multiplication (SPMVs) algorithms are simple, they include important parts of Linear Algebra algorithms in Mathematics and Physics areas. As these algorithms can be run in parallel, Graphics Processing Units (GPUs) has been considered as one of the best candidates to run these algorithms. In the recent years, power consumption has been considered as one of the metr...

متن کامل

Transposition Mechanism for Sparse Matrices on Vector Processors

Many scientific applications involve operations on sparse matrices. However, due to irregularities induced by the sparsity patterns, many operations on sparse matrices execute inefficiently on traditional scalar and vector architectures. To tackle this problem a scheme has been proposed consisting of two parts: (a) An extension to a vector architecture to support sparse matrix-vector multiplica...

متن کامل

A Survey on Performance Modelling and Optimization Techniques for SpMV on GPUs

Sparse Matrix is a matrix consisting of very few non-zero entries. Large sparse matrices are often used in engineering and scientific operations. Especially sparse-matrix vector multiplication is an important operation for solving linear system and partial differential equations. However, there is a possibility that even though the matrix is partitioned and stored appropriately, the performance...

متن کامل

Performance Comparison of Storage Formats for Sparse Matrices

The sparse data structure represents a matrix in space proportional to the number of non-zero entries. Many storage formats have been proposed to represent sparse matrices. In this paper we evaluate and compare the storage efficiency of various sparse matrix storage formats, and consider the performance results of matrix-vector multiplication using these storage formats.

متن کامل

Blocked-based sparse matrix-vector multiplication on distributed memory parallel computers

The present paper discusses the implementations of sparse matrix-vector products, which are crucial for high performance solutions of large-scale linear equations, on a PC-Cluster. Three storage formats for sparse matrices compressed row storage, block compressed row storage and sparse block compressed row storage are evaluated. Although using BCRS format reduces the execution time but the impr...

متن کامل

Performance optimization of Sparse Matrix-Vector Multiplication for multi-component PDE-based applications using GPUs

Simulations of many multi-component PDE-based applications, such as petroleum reservoirs or reacting flows, are dominated by the solution, on each time step and within each Newton step, of large sparse linear systems. The standard solver is a preconditioned Krylov method. Along with application of the preconditioner, memory-bound Sparse Matrix-Vector Multiplication (SpMV) is the most time-consu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1608.00636  شماره 

صفحات  -

تاریخ انتشار 2016